An improved extrinsic monolingual plagiarism detection approach of the Bengali text

نویسندگان

چکیده

Plagiarism is an act of literature fraud, which presenting others’ work or ideas without giving credit to the original work. All published and unpublished written documents are under cover this definition. Plagiarism, increased significantly over last few years, a concerning issue for students, academicians, professionals. Due this, there several plagiarism detection tools software available detect in different languages. Unfortunately, negligible has been done no Bengali language where one most spoken languages world. In paper, we have proposed tool that mainly focuses on educational newspaper domain. We collected 82 textbooks from National Curriculum Textbooks (NCTB), Bangladesh, scrapped all articles 12 reputed newspapers compiled our corpus with more than 10 million sentences. The method text shows accuracy rate 97.31%

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Monolingual and Crosslingual Plagiarism Detection

Automatic plagiarism detection considering a reference corpus compares a suspicious text to a set of documents in order to relate the plagiarised fragments to their potential source. The suspicious and source documents can be written wether in the same language (monolingual) or in different languages (crosslingual). In the context of the Ph. D., our work has been focused on both monolingual and...

متن کامل

A Pairwise Document Analysis Approach for Monolingual Plagiarism Detection

The task of plagiarism detection entails two main steps, suspicious candidate retrieval and pairwise document similarity analysis also called detailed analysis. In this paper we focus on the second subtask. We will report our monolingual plagiarism detection system which is used to process the Persian plagiarism corpus for the task of pairwise document similarity. To retrieve plagiarised passag...

متن کامل

An Effective Approach for Compression of Bengali Text

In this paper, we propose an effective and efficient approach for compressing Bengali Text. This paper focuses on a methodical study on Bengali text compression techniques. The main target of this research is to provide a framework for Bengali text compression; which ensures a simple and computationally inexpensive effective scheme for Bengali text compression. The proposed Bengali text compres...

متن کامل

A Novel Approach for Plagiarism Detection in English Text

Digitalization provides text easily available on web interrelated to several academic areas. So it becomes a serious problem for academic enterprises or institutes. This paper presents Plagiarism detection system for the English language. Digital World provides text easily available on web interrelated to several academic areas. So it becomes a serious problem for academic enterprises or instit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Power Electronics and Drive Systems

سال: 2023

ISSN: ['2722-2578', '2722-256X']

DOI: https://doi.org/10.11591/ijece.v13i4.pp4256-4267